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Endian Transformation 

This invention relates to an endian transformation methocTand system. 

A problem coimnonly encountered by emulation systems, which run identical 
software on different computer processor chips is format incompatibility. One aspect 
of this incompatibility resides in the format in which strings of data (eg. 2-byte words 
or 4-byte words) are expressed. In many computer architectures, each byte of a 4-byte 
word has its own individual memory address; this gives rise to two possibilities for 
numbering the bytes within a word. In a big-endian convention, the word whose bytes 
are addressed (X, X+1, X+2 AND X+3) has its most significant byte addressed X, 
while in a little-endian convention, the address ordering is the reverse of this so that 
the least significant byte is addressed X and the most significant byte is addressed 
XH-3. Other "endian formats" are known in which different conventions are observed 
for ordering the significance of bytes within words, but in most commercially 
available systems either the big-endian or little-endian convention is observed. The 
ordering of bits within each byte is the same whether the memory address convention 
is big-endian or little-endian. 

Emulation systems are available which enable software (such as an operating 
system or an application program) of one endian format to operate on hardware of an 
opposite endian format. Generally, systems of this type convert each word between 
endian representations on a word-by-word basis. This conversion, when required 
firequently, introduces a significant overhead into the time required to perform a given 
task. 

It is an object of the present invention to provide an efficient method and 
system to enable software of one endian format to run on hardware of a different 
endian format. 

According to the invention there is provided a method for emulating a 
processor of a first type which observes a first convention for ordering the 
significance of bytes within words on a second type of processor which observes a 
second convention for ordering the significance of bytes within words, wherein 
memory access addresses are transformed such that bytes stored in a memory 
addressed by a processor of the second type as a result of an instruction in which a 
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byte order in accordance with the first convention is observed are distributed in a 
pattern which is a mirror image of the distribution pattern of the bytes which would 
result if the memory was addressed by a processor of the first type in response to the 
said instruction. 

The invention also provides a method for emulating a processor of a first type 
which observes a first convention for ordering the significance of bytes within words 
on a second type of processor which observes a second convention for ordering the 
significance of bytes within words, the order of the second convention bemg the 
reverse of the order of the first, wherein memory access addresses are transformed 
such that the offset between addresses of any two bytes stored in memory is unaltered 
by the transformation and the relative order of the addresses of any two bytes stored in 
the memory is reversed by the transformation. 

The invention fiirther provides a method for emulating a processor of a first 
type which observes a first convention for ordering the significance of bytes within 
words on a second type of processor which observes a second convention for ordering 
the significance of bytes within words, wherein memory access addresses are 
transfonned such that strings of bytes in the first endian format which are stored 
successively by the processor operating in accordance with the second endian format 
aggregate in the same manner as the bytes would aggregate if the processor was of the 
first endian format and memory access addresses were not transformed. 

The invention still fiirther provides a method for emulating a processor of a 
first type which observes a first convention for ordering the significance of bytes 
within words on a second type of processor which observes a second convention for 
ordering the significance of bytes within words, wherein each memory access address 
B of string length L is transfomied to the address A-B-L+S, wherein A is the total 
number of bytes allocated to a program, and S is the start address of the program. 

Assuming a big-endian processor and a little-endian program, the address 
transfonnations ensure that bytes aggregate in the memory in a pattern which is a 
mirror image of the pattern which would have resulted if the processor had been little- 
endian and no address transformation had been performed. The invention will operate 
in the same manner for a little-endian processor and a big-endian program. It is 
important to note that the transformation has no effect on the ordering of bits within 
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each byte. The result is a system which provides a considerable time saving when 
compared to known endian conversion methods, which convert each string of bytes 
between endian representations each time that string is used. ^' 

According to the invention there is provided an endian transformation system, 
the system comprising means for transfonning an address location of a code 
represented in a first endian format into an address in a second endian fonnat, the 
transformation comprising introducing an offset into the address, the size of the offset 
being determined from the difference between the address location of the code and a 
predefined address location. 

According to a further aspect of the present invention there is further provided 
a process for compiling or translating a computer program code instruction using 
transformed address space references in the compiled or translated code especially 
configured for execution on a programmable machine utilizing a corresponding 
predetermined convention for ordering the significance of bytes within words of said 
address space, said process comprising: 

(a) during compilation or translation of a code instruction referring to a 
memory address, transforming the referenced memory address with respect to a fixed 
block size of memory in the predetermined programmable machine so as to change 
the referenced address value by an amount that is fixed for a given number of bytes 
being accessed in each word; and 

(b) including the thus changed address reference in a compiled or 
translated output instruction so that there is no extra operation required during 
execution of the output instruction to accommodate the convention for ordering bytes 
within words used by said predetermined programmable machine. 

A specific embodiment of the invention will now be described by way of 
example only. 

The following two assembly code store instructions 
movl $Oxaabbccdd,[0] 
movl $0xll223344,[2] 
will have the following effect in a little-endian architecture: 
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1st Store 




2nd Store 
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The first store instruction stores the least significant byte (dd) of the first 4- 
byte word (aabbccdd) at address *0*, flie second least significant byte at address *r, 
etc. The second store instruction stores the least significant byte (44) of the second 4- 
byte word (11223344) at address *2', the second least significant byte at address *3', 
etc. Thus, the effect of storing the second 4-byte word is to overwrite the prior 
contents of addresses '2' and *3', and the two most significant bytes of the first 4-byte 
word *aa' and *bb' are lost. 

If the same two store instructions are used in a big-endian architecture, the 
most significant byte (aa) of the first 4.byte word will be stored at address '0\ the 
next most significant byte (bb) at address *r, etc. The second store instruction will 
overwrite the prior contents of addresses *2' and *3', as before, and the two least 
significant bytes of the first 4-byte word 'dd' and 'cc' will be lost. Thus, when a big- 
endian architecture is used, the contents of an addressed memory resulting fi-om 
assembly code store instructions will differ from the contents of an equivalent 
memory when a little-endian architecture is used. 

The invention allows both big-endian and little-endian words to be stored in 
such a way that any given store instruction will result in the same bytes being stored 
in both architectures, although the order of the bytes is reversed. This is achieved by 
transforming memory access addresses such that the pattern of bytes stored in a 
memory addressed by for example a big-endian processor is a mirror image of the 
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pattern which would have resulted if the memoiy had been addressed without 
transformation by a little-endian processor. 

In the case of the assembly code store instructions givBK above, in order to 
accommodate a big-endian architecture, the two 4-byte words are stored in accordance 
with the present invention at the uppermost available addresses as shown below: 



1st Store 




2nd Store 
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To preserve information in the memory, the second of the two stores places the 
second 4-byte word at a lower address than the first 4-byte word, thus overwriting the 
same 2-byte word (Oxaabb) of information as in the little-endian architecture. The 
effect in terms of the bytes stored in the memory of the assembly code instructions in 
the little-endian architecture is thus duplicated in the big-endian architecture, although 
the order of the bytes is reversed. 

The above exemplification of the system in accordance with the present 
invention in relation to an unaligned store instruction demonstrates the flexibility of 
the system. The system may also be used for aligned store instructions. 

The address transformations used as described above to preserve information 
in the big-endian architecture are: 



access type 


Adjustment 


word (4-byte) 


addr'=[20-addr] 


word (2-byte) 


addr'=[22-addr] 


byte 


addr'=[23-addr] 
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This generalises to: 



access type 


general adjustment 


where 


word (4-byte) 


addr'=endianAdj_L-addr 


EndianAdjJL=progSize-4 


word (2-byte) 


addr*=endianAdj_W-addr 


EndiaiiAdj_W=progSize-2 


byte 


addr'==endianAdj_B-addr 


EndianAdj_B=progSize- 1 



Thus, using the generalisation shown in the above table, the following 
operations in a little-endian architecture: 
movw $Oxaabb,[l] 
movl [l],%eax 

will have the same effect as the following operations in a big-endian architecture: 

movw $0xaabb,[22-l] 

movl[20-l],%eax 
The effect of the above commands is shown below: 



little-endian 
3rd Store 




big-endian 
3rd Store 
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The invention introduces one extra arithmetic operation for every load/store 
instruction. However, many instructions which access memory use address 
expressions which contain constant offsets such as: 

addl %edx,0x8(ebp,eax,4) 
which represents the effective address: 

ebp+eax*4+8. 
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This expression, after memory access transformation in accordance with the invention 
has been applied, becomes: 

endianAdj_L-(ebp+eax*4+8). ^ 
Folding the constants of the expression can be used to give: 

(endianAdj_L-8Hebp+eax*4). 
Thus, folding allows those teims which may be calculated at translation time to 
separated from those terms which are held in registers and are unknown at translation 
time. Since the term *endianAdj_L* is known at translation time its effect is 
calculated before run time, and the memory access transformation will not cause a 
loss of performance at run time. Thus, in general the big-endian transformation of the 
invention incurs no extra overhead for the majority of memory accesses. 

A subject machine program (or operating system) is treated as if it is loaded 
contiguously from address 0, while internally being stored as a mirror image, as 
shown below: 



Actual Memory 
Configuration 



Memory Configuration 
intended by the assembly code 
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If the assembly code specifies access to the 4-byte value 0x00000003 at 
memory location 3, using the same program size as the previous examples, this 
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memoiy access becomes endianAdj_L-3 = (progSize-4)-3 = (24-4)-3 = 17, which is 
the address in the big-endian mirror image of the value required. 

Whereas the above examples illustrate use of the invenKon in transforming 
code intended for a little-endian architecture so that it will run on a big-endian system, 
the invention could be used to transform big-endian code to run on a little-endian 
system. Indeed, the invention can be used to transform between any two endian 
systems which are byte reversals of one another. 

The endian transformation method may be used as part of a complete 
emulation system. 

The advantages joined by the "folding" operation described above are not 
limited to the particular transformation described. A similar operation may be 
performed in other compilation or translation processes using transformed address 
space references in the compiled or translated code to include the changed address in 
an output instruction to reduce overheads during execution of that output instruction. 
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Claims 

1. A method for emulating a processor of a first type wluch observes a first 
convention for ordering the significance of bytes within words on a second type of 
processor which observes a second convention for ordering the significance of bytes 
within words, wherein memory access addresses are transfonned such that bytes 
stored in a memory addressed by a processor of the second type as a result of an 
instruction in which a byte order in accordance with the first convention is observed 
are distributed in a pattern which is a mirror image of the distribution pattern of the 
bytes which would result if the memory was addressed by a processor of the first type 
in response to the said instruction, 

2. A method for emulating a processor of a first type which observes a first 
convention for ordering the significance of bytes within words on a second type of 
processor which observes a second convention for ordering the significance of bytes 
within words, the order of the second convention being the reverse of the order of the 
first, wherein memory access addresses are transformed such that the offset between 
addresses of any two bytes stored in memory is xmaltered by the transformation and 
the relative order of the addresses of any two bytes stored in the memory is reversed 
by the transformation. 

3. A metiiod for emulating a processor of a first type which observes a first 
convention for ordering the significance of bytes within words on a second type of 
processor which observes a second convention for ordering the significance of bytes 
within words, wherein memory access addresses are transformed such that strings of 
bytes in the first endian format which are stored successively by the processor 
operating in accordance with the second endian format aggregate in the same manner 
as the bytes would aggregate if the processor was of the first endian format and 
memory access addresses were not transfonned. 

4. A method for emulating a processor of a first type which observes a first 
convention for ordering the significance of bytes within words on a second type of 
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processor which observes a second convention for ordering the significance of bytes 
within words, wherein each memory access address B of string length L is 
transformed to the address A-B-L+S, wherein A is the total numBer of bytes allocated 
to a program, and S is the start address of the program. 

5. An endian transformation system, the system comprising means for 
transforming a memory access address in accordance with a method according to any 
preceding claim. 

6. A process for compiling or translating a computer program code instruction 
using transformed address space references in the compiled or translated code 
especially configured for execution on a programmable machine utilizing a 
corresponding predetermined convention for ordering the significance of bytes within 
words of said address space, said process comprising: 

(a) during compilation or translation of a code instruction referring to a 
memory address, transforming the referenced memory address with respect to a fixed 
block size of memory in the predetermined programmable machine so as to change 
the referenced address value by an amount that is fixed for a given number of bytes 
being accessed in each word; and 

(b) including the thus changed address reference in a compiled or 
translated output instruction so that there is no extra operation required during 
execution of the output instruction to acconamodate the convention for ordering bytes 
within words used by said predetermined programmable machine. 

7. A process according to claim 6, wherein said code is a computer program 
source code. 

8. A process as in claim 6 or claim 7, wherein said change causes said fixed 
block of memory to be addressed from a predetermined one of its two ends depending 
upon the convention utilized by said predetermined programmable machine for 
ordering the significance of bytes within words. 
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9. A process as in any one of claims 6 to 8, wherein said change causes the fixed 
block of memory contents for a big-endian machine to be inverted to the miiror image 
ofthat for a little-endian machine. 

1 0. A method of endian transformation substantially as hereinbefore described. 

11. A system for providing endian transformation substantially as hereinbefore 
described. 

12. A process for compiling or translating a complete program code substantially 
as hereinbefore descried with reference to the accompanying drawings. 
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preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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